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APPLICATION  OF  THE  UNITERM  SYSTB*  AND  THE 
ASSOCIATION  OF  IDEAS  TO  A  SPECIAL  LIBRARY  FILE1 

TECHNICAL  REPORT  NO.  10 

Perhaps  the  best  way  to  Indicate  the  depth  of  a  detailed  analy¬ 
sis  possible  with  a  Uniterm  System  as  contrasted  with  systems  that  make 
a  card  or  cards  for  each  item  is  to  compare  the  1148  punching  positions 
in  the  total  system  presently  employed  by  the  Prevention  of  Deteriora¬ 
tion  Center  with  the  fact  that  for  the  first  five  hundred  reports  we  in¬ 
dexed,  out  of  the  same  group  of  reports  indexed  by  FDC,  we  generated  a 
Uniterm  vocabulary  of  approximately  3000  Uniiterms,  including  names  of 
specific  materials,  organisms,  chemicals,  etc.  Our  average  assignment  of 
Uniterms  per  item  was  25.  We  were  also  able  to  index  in  a  few  months 
more  than  the  average  annuai  load  of  the  Prevention  of  Deterioriation 
Center.  On  the  matter  of  the  adequacy  of  our  methods  for  meeting  the 
reference  needs  of  the  Center,  the  Office  of  Naval  Research  recently  pre¬ 
pared  a  series  of  experimental  questions,  and  by  using  this  method,  was 
able  to  get  satisfactory  answers.  But  it  is  possible  to  indicate  cer¬ 
tain  theoretical  problems  that  must  attend  the  use  of  the  Uniterm  system, 
and  which  we  hope  to  solve  by  employing  a  new  technique  of  retrieval 
by  association. 


This  report  was  prepared  as  part  of  a  joint  paper  with  Dr.  Curtis  L.  Brown 
Prevention  of  Deterioriation  Center,  National  Research  Council,  entitled, 
"Two  Approaches  to  the  Retrieval  of  Information  from  a  Special  Library." 
Dr.  Brown's  paper,  "Application  of  a  Simple  F\inched  Card  System  to  a 
Special  Information  Service,"  and  this  paper,  are  to  be  read  before  the 
Division  of  Chemical  Literature,  American  Chemical  Society  Meeting, 
Minneapolis,  Minn.,  September,  1955. 


The  Uniterm  system,  like  all  other  systems  of  retrieval « 
operates  with  words.  And  slhee  this  operation  Is  frank  and  un¬ 
ashamed*  the  Uniterm  system  is  often  oompared  unfavorably  with  other 
systems  which  presumably  use  "ideas’*  or  "subjects*  or  "semantio  con¬ 
tent"  rather  than  words  as  instruments  of  storing  and  retrieving 
information.  1  will  grant  that  in  love  and  empathy  there  is  commu¬ 
nication,  and  information  may  be  sent  in  a  wink  and  received  in  the 
viscera  without  the  mediation  of  language*  but  t  think  that  you  must 
likewise  grant  that  all  formal  communication*  all  formal  storage  of 
information,  must  communicate  and  store  Symbols  and  not  meanings, 
even  though  our  purpose  is  always  to  communicate  meanings  and 
semantio  content* 

It  is  certainly  the  beginning  of  wisdom  to  recognize  that 
words  are  slippery  things  and  that  we  cannot  make  an  easy  one  to  one 
identification  of  symbol  and  meaning*  The  same  symbol  can  mean  dif¬ 
ferent  things  as  with  homonyms*  But  surely  this  elementary  insight 
is  not  all  that  is  meant  by  those  who  contrast  words  and  meanings 
in  indexing  systems.  To  use  "wrench"  rather  than  "spanner'*  to  stand 
for  a  certain  type  of  physical  object  or  to  use  "tube"  rather  than 
"valve"  for  another  is  certainly  to  choose  a  word  but  it  is  not  to 
dispense  with  words  and  confront  the  searcher  with  meanings  unmediated 
by  symbols.  For  let  us  be  quite  clear  about  this  —  and  I  wish  X 
could  say  It  as  wittily  as  Fa  if  t  home  did  In  a  recent  paper  before 


the  joint  AGARD-SLA  mooting  on  Documentation  —  communication  systems 
and  stotago  systems  deal  with  marks,  noises.  Shapes,  sounds,  patterns, 
etc.  and  not  with  the  semantic  content  of  these  things.  Undoubtedly, 
the  miracle  of  communication  is  that  we  can  get  semantic  content  from 
one  mind  to  another  by  manipulating  shapes  and  noises.  Let  us  not 
denigrate  this  miracle  by  asking  chemists  and  librarians  to  distrust 
it  and  to  talk  to  one  another  only  in  tho  unmediated  meanings  of  love 
and  intuition. 

One  of  the  ways  to  make  language  more  precise  and  a  better 
conveyor  of  meanings  is  to  uso  names  rather  than  descriptions.  The 
distinction  botween  names  and  descriptions  is  an  Important  and  abstruse 
topic  of  modern  logic  and  we  need  to  do  little  more  here  than  Indlcato 
the  difference.  When  I  say  "x  is  carnivorous",  I  am  describing  "x". 

When  X  say  "x  Is  a  carnivora",  I  am  naming  "x".  Now  a  description  may 
be  false  while  a  name  Is  never  false.  If  I  say  "the  man  named  Mortimer 
Taube  is  reading  this  paper",  this  is  an  attribution  of  a  symbol  which 
is  not  significantly  true  or  false.  But  if  X  say  the  man  reading  this 
paper  is  wise  or  a  fool,  I  hope  X  can  get  a  significant  argument  on  tho 
truth  or  falsity  of.  one  statement  or  the  other. 

Now  there  aro  certain  sciences  in  which  the  activity  of 
assigning  names  plays  a  very  important  role.  In  these  sciencos  if 
everything  has  only  one  name  and  these  names  are  used  and  known  uni* 
vers ally  than  wo  can  build  orderly  communication  and  storage  systems 
using  as  symbols  only  tho  agreed-upoa  names.  Especially  is  this  true 
if  tho  names  are  used  in  a  fixed  pattern,  e.g.,  the  Cabots  being  divided 


Into  the  Boston  Cabots  and  the  New  York  Cabots*  A  description  can  be* 
come  a  name  when  its  attribution  to  an  entity  becomes  a  question  of 
definition  and  not  of  fact..  Thus,  of  alt  systems  of  communication  and 
storage,  taxonomies  are  the  most  purely  verbal  or  symbolic  and  th© 
least  concerned  with  the  Interplay  of  symbol  and  meaning  or  the  seman¬ 
tic  content  of  the  symbols* 

A  languago  which  has  been  frozen  into  a  taxonomy  by  the 
conversion  of  descriptions  Into  nomas  becomes  a  mold  into  which  wo  may 
forco  tho  facts  and  ideas  which  are  the  concern  of  any  science;  and  it 
is  tho  usual  fate  of  taxonomies  to  bo  discarded  en  toto  when  as  arti- 
ficial  things  they  become  too  far  divorced  from  reality,  rather  than 
to  grow  and  change  as  a  living  language  grows  and  changes  as  it  attempts 
to  symbolize  ever  richer  conglomerates  or  systems  of  facts  and  experience* 

But  living  languages  are  slippery  things  —  they  refleet  the 
meanings  of  days  On  which  they  are  used  and  tho  different  people  who  use 
them  —  how  then  con  we  use  o  living  language  not  only  for  poetry  and 
song,  but  for  science  and  the  storage  of  information  which  is  the  physi¬ 
cal  memory  of  science.  Is  thoro  a  third  possibility  which  does  not 
commit  us  to  the  fixed  and  ultimately  futile  patterns  of  a  taxonomy  or 
artificial  language  and  yet  which  lets  us  make  an  orderly  use  of  the 
rich  diversity  of  living  language.  The  answer  I  think  is  to  bo  found 
in  what,  following  Vannevar  Bush,  we  have  called  retrieval  by  associa¬ 
tion  rather  than  by  fixed  hierarchical  patterns.  And  hero  once  more 
I  must  quote  another  roan  who  has  expressed  the  saroo  thought,  independently 
arrived  at,  in  different  words t 


"In  r%  opinion,  the  problem  of  bringing  pertinent 
vocabularies  Into  coincidence  is  tin  crucial  On© 
for  ell  documentation  systems  whether  operated  by 
classifications,  indexes*  os  md'Ams.  tin  cyetem 
rust  display  its  trombu'lery  for  selection.  For  the 
field  of  clioTiistry,  this  vocabulary  is  so  enormous 
and  is  crowing  so  rapidly  that  it  Is  itspoosifele  be- 
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•election*'  indoTses  give  users  cross  refere aeet  end 
display  suggestive  and  now  terms  In  the  ctodifico- 
time  under  headings.  fbo  searchers  aid  in  bring¬ 
ing  their  vocabularies  into  coincidence  by  coarse* 
of  Study,  textbooks,  reftronca  twfe*  review, 
»aw>g»opfes*  dictionaries,  current  literature,  in¬ 
dexes,  and  the  life.  At.  ittoaoat.  the,  .process,  of 
hylwtlnn  .yecffial  arise  into,  .oolncf.fkmaa  .lor feKwj* 
imsiiLisa^^ 

The  important*  point  to  note  is  that  nil  dqsu^mta* 
tioa  oystcaa,  including  cachcnlaod  ones*  ore  itoed 
®  tais  problem  of  bringing,  pertinent-  vocabular¬ 
ies  into  coincidence  nod  that  If  coincidence  is  net 


effected  then  significant  iafom&tios  is  certain  to 
be  missed,  difficulties  with  bread  generic  searches, 
formerly  attributed  by  r,zm  of  us  Interested  in 
wschnaistitiea  solely  to  indexes*  era,  in  reality* 
cnased  by  the  difficulties  of  bringing  vocabularies 
into  coincidence."  2 


this  boors  that  the  alternative  to  a  taxonomy  for  0  broad 
generic  search  Is  a  »ecl»nlaatd  system  of  bringing  vocabularies  into 
coincidence,  or  us  we  have  put  it,  displaying  the  associations  in  the 

system. 


i  have  many  time*  told  the  story  of  the  early  days  of  the 
Bailors  system  wfes  wo  tried  to  demonstrate  its  method  by  ashing  visitors 
to  our  offices  to  pick  any  two  cards  raid  find  the  number  common  to  both. 


^Charles  Bernier*  ’Organizing  Abstract  information,"  unpublished  paper 
road  before  American  Documentation  Institute,  fevombor  6,  1933* 


The  cards  would  be  selected  and  no  common  numbers  would  bo  found. 
Slightly  embarassed,  we  would  suggest  another  two  cards*  many  times 
with  the  some  result.  To  avoid  this  situation  we  prepared  a  "canned 
sample"  of  a  set  of  cards  having  numbers  in  conation,  and  we  gave  this 
set  to  all  casual  visitors.  What  we  had  discovered,  of  course,  was 
the  lack  of  suggostiveness  or  coincidence  of  vocabulary  in  a  Uniterm 
system.  If  you  ask  a  Uniterm  system  what  It  has  on  "A"  and  "B",  it 
will  give  you  an  answer  but  if  you  ask  the  system  to  suggest  terms 
which  define  a  subject  together  with  "A",  the  system  remains  mute  and 
helpless.  And  this  situation  of  wondering  what  is  the  right  question 
to  put  to  the  system  is  more  difficult,  the  richer,  the  more  flexible 
and  more  detailed  the  vocabulary  of  the  system.  Think  of  the  situation 
os  it  might  face  the  user  of  a  Uniterm  system  in  the  PDC  —  with  its 
concern  for  "thousands  of  distinctly  different  materials  such  as  alloys, 
elastomers,  and  fibrous  products,  each  available  In  a  multitude  of 
forms  and  formulations.  .  .  /and7  an  equal  If  not  greater  variety  of 
biological  organisms  and  chemical  agents  which  have  the  unpleasant  habit 
of  attacking  materials  and  structures.  .  .  ^and7  a  legion  of  accepted 
and  experimental  pesticides,  oxidation  and  corrosion  inhibitors,  fire 
and  waterproofing  treatments,  and  other  special-purpose  preservatives, 
for  which  the  systematic  nomenclature  alone  is  enough  to  drive  on  in¬ 
dexer  or  coder  crazy."  I  do  not  approach  this  system  as  I  might  if 
making  a  patent  search,  to  inquire  whether  anyone  may  utilize  a  certain 
composition  for  a  certain  purpose.  Rather,  I  may  want  to  knew  all  the 
compositions  that  have  been  used  for  a  certain  purpose  and  I*m  unable 


• 


to  think  of  their  names.  Or  1  want  *11  th*  use*  to  which  a  certain 
composition  has  boon  put  and  I’m  unable  to  think  of  til  the  terms  which 
should  bo  searched.  In  ordinary  affairs  our  memories  help  us  to  re¬ 
call  the  symbols  of  meanings  wo  wish  to  cewufticate*  but  when  I 
Interrogate  a  system  which  1s  not  a  creation  of  my  own  experience* 
my  memory  Cannot  help  me  to  know  the  limitations  or  the  richness  of 
the  symbols  under  which  information  has  been  stored.  The  systosa 
must  have  its  own  memory  which  it  can  display  to  the  searcher  on 
demand.  Thus,  If  we  are  concerned  as  in  the  example  given  with 
various  types  of  paint*  that  have  been  tested  under  certain  physical 
conditions,  a  physical  memory  system  must  associate  all  types  of 
paints  or  coatings  with  a  specified  physical  condition  under  which 
they  have  been  tested..  Such  it  memory  device  is  entirely  feasible 
and  wo  have  built  a  working  model  in  which  we  have  entered  certain 
of  the  terras  of  our  vocabulary. 

One  thing  we  have  learned  during  the  past  year  if  that  it 
if  virtually  impossible  to  make  a  manual  dictionary  of  associations. 
The  complexity  of  memory  and  associative  trails  grows  so  rapidly 
that  only  a  system  of  mechanical,  optical  or  electronic  channels 
or  relays  can  perform  tha  proper  manipulations  and  present  the  proper 
disclosures.  Wc  have  employed  a  combination  of  punched  holes  and 
electrical  circuits,  in  what  is  basically  a  very  simple  system.  On 
a  card  representing  any  term  in  the  system  we  will  punch  a  hole  for 
each  term  associated  with  tho  term  heading  the  card.  By  making 
electrical  contact  through  such  holes  we  can  light  up  on  a  screen 


all  the  words  associated  with  any  given  word*  If  from  the  words 
displayed  we  Select  a  torn  of  Interest  to  us  wo  can  superimpose 
the  card  for  this  term  on  the  card  for  the  original  term.  This 
will  black  out  a  certain  number  of  terms  on  the  display  which 
arc  not  associated  with  both  terms  of  the  ({uestion,  and  so  on 
through  associations  of  three*  four,  and  more  terms.  The  more  terra 
Cards  we  superimpose  on  ono  another*  the  tighter  and  more  condensed 
will  be  our  network  of  associations. 

The  importance  of  suggestiveness  or  a  mechanism  of 
associations  in  an  information  system  varies  from  field  to  field* 

In  a  theoretical  science  where  connections  are  theoretically  p re* 
dictabio,  we  do  not  need  such  a  mechanism.  Wo  do  not  have  to  be 
reminded  of  the  "association"  of  mass  and  energy.  But  a  memory 
that  tell  us  that  certain  paints  resist  certain  organisms  and  that 
the  system  ooatalns  positive  information  on  this  fact  must  enhance 
the  retrieval  potentialities  of  the  system. 

Since  this  is  a  symposium  on  punch  cards,  one  final  word 
Is  in  order  on  the  possible  mechanisms  which  can  be  used  for  systems 
of  association.  In  a  previous  paper  wo  expressed  tho  conclusion  that 
since  on  association  system  was  concerned  with  terms  and  ideas  rather 
than  with  items,  the  Uniterm  or  Batten  system  of  a  card  per  term  or 
idea  was  a  much  more  suitable  base  than  a  punched  card  machine  employ¬ 
ing  a  card  or  cards  per  item.  We  have  since  been  informed  that  the 
more  elaborate  IBf.l  machines  like  the  701  can  manipulate  associations 
and  can  in  effect  superimpose  onepattern  of  associations  on  another 


and  point  out  the  common  associations*  I  don't  know  exactly  how 
this  is  done*  but  I  would  encourage  anyone  with  a  701  machine 
available  to  him  to  give  it  a  try. 


Respectfully  submitted, 


'  /  ./j  /  /l/t/l  ) 

'  s/i’/ fs/So:  I  ■  / t/J ■'>./\ 


Mortimer  Taube 
President 
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